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E»  Wo  Barankin 


io  INTRODUCTION 


Consider  a real  valued  function  f on  a domain  Do  We  assume  that  D 
is  a finite  set  of  points,  say  N in  number 0 Let  M - max  f(x),  and 


suppose  f takes  on  the  value  M at  exactly  V points;  the  set  of  these 
points  will  be  denoted  by  K. 

Far  from  looking  upon  our  assumption  of  finiteness  of  D as  a 
specialization  we  consider  that  it  reflects  the  real  computing  situa- 
tion0  Consequently,  our  results  pertain  also  to  situations  in  which  D 
is  taken  to  be  a continuum  and  in  which  continuous  distributions  are 
employed;  it  is  required  only  to' recognize  the  discontinuous  distribu- 
tions to  which  the  given  continuous  distributions  are  approximations,. 

It  is  our  purpose  here  to  study  stochastic  processes 


more  particularly,  to  study  the  implications  of  such  a process  for 
the  determination  of  M when  the  realized  values  x^  of  the  random 
variables  X^  are  submitted  to  computation  of  f~  values.  To  be  precise, 
we  suppose  that  after  each  stage  of  the  process,  say  the  i-th,  if  the 
point  x^  £ D is  the  realized  value  of  X^,  we  compute  f(x^).  Thus, 

The  preparation  of  this  paper  was  sponsored  (in  part)  by  ARDCo 
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where  each  X^  is  a random  variable  in  D; 
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after  the  k-th  stage,  if  the  realized  values  of  Xp  X2,  ° ° ° , X^  are 
x]_s  x25  °e<>  s x}c5  respectively,  then  we  have  before  us  the  f-  values 
f(x^),  fCx^,  ->o°  5 f(x^)  0 This  represents  a certain  amount  of  infor^- 
mation  about  the  function  fj  let  us  denote  it  by  cfi  ^°(x^5  Xg,  *°°  9 x^.)  „ 
It  may  be  that  some  general  knowledge  concerning  f can  be  coupled 
with  the  information  J ^°(x^5  x2,  °°°  9 x^)  t°  give  still  a greater 
amount  of  specific  useful  information  after  the  k~th  stage.  Accordingly, 
we  shall  take  the  k-th  stage  information  scheme  to  be  in  general  as 
follows,  in  addition  to  J , °t 


V(xr 


x 2 » 


, xk)  = 


a known  number 


> V 4 


max 

i=ls23° 


f(x^)  such  that  it  is  known 


that  M ^ Tk(xi5  x2, 


xJ 


V(xl>  x2. 


x^)  % a known  subset  of  D,  Ak+^(x^5  x2,  • ° ° s x^,)  — 
|x^,  Xg,  •*°  3 x^j  such  that  for  each 
x t \+i^xl3  x2'5  0°°  5 xk^5  although  the 
number  f(x)  may  not  be  known  exactly,  it 
is  known  that  f(x)  - T^Cx^,  x^,,  000  9 x^)  ; 


^k  'xls  x23  °“°  s xk)  i a known  subset  of  Ds  Bk+i^xis  x2,  -sxk)) 

disjoint  from  Ak+^(x^,  x2,  0 0 • ? x^) , such 
that  for  each  x £ Bk+]_(x-p  X23  000  s x^)  the 
number  f(x)  is  not  known  exactly, but  it  is 
known  that  f(x)  > T^Cx^,  X2,  °°°  , x^) 


' 
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The  special  case  of  this  scheme , wherein 


T,(x-,s  x?3  °°°  j xj  = max  f(x.)  3 

i-ls2,-..,k 


^ ■ { 


Xf3  X2  3 ° ° ° 3 ^ 5 


and 


Bk+l^xl?  *2; 


A 


(where  A denotes  the  empty  subset  of  D)  is  that  in  which 
<9^°(xi3  Xgs  0 0 0 3 xk)  is  actually  the  total  information  at  hand. 

On  the  basis  of  the  above  information  scheme,  an  immediately 
suggested  estimated  value  for  M is  T^Xx-p  X23  0 0 • , x^)  „ There  are 
alternatives  to  this]  for  example,  the  maximum  of  the  function  on  D 
obtained  by  a specified  extrapolation  of  the  known  values  of  f on 


However,  in  this  article  we  shall  fix  our 


attention  on  the  estimator  i = 1,  2,  [ „ We  are  consequently 

committed  to  study  the  stochastic  convergence  of  the  sequence  of 
random  variables  J - {^(X^),  X2) 

Of  course,  we  have  for  all  k = ls  2, 


• 0 ° , and  all 


:]_s  ^2 : 


s xk3  xk+l5 


(1.1) 


Tk(xi,  x23  ...  3 xk)  ^ M 


and 


(l02)  ^k+l^xl5  X2S  °°°  s ^k+l^  ^k^xls  x2- 


xk) 


o o 


* 


h 


From  (l„2)  it  follows  that 


(le3) 


^k+2^xl5  *2 5 


Vl5  3 ^k+l^^l 


j)  2 


xk) 


The  stochastic  process  »&  that  will  be  at  work  producing  points 
of  D,  will  be  conditioned  by  the  computer's  behavior „ This  condition- 
ing is  effected  upon  the  conditional  probabilities 


(loU) 


{= 


X 


■k+1 


Xi  = 


1,  2, 


5 x & D « 


The  conditioning  is  realized  through  the  choice , by  the  computer,  of 
random  devices  which  (he  guesses)  will  secure  desirable  analytic  condi- 
tions on  the  conditional  probabilities  (l.h)o  Just  what  such  desirable 
analytic  conditions  are  will  be  determined  by  studying  the  implications 
of  various  analytic  conditions  on  (l.h)  in  the  light  of  our  designated 
criteria  for  the  process  <,  Our  criteria  in  this  article  are  ‘the 
following:  large  values  for  the  probabilities 


(i.s)  P{VX!»  v ••• . V " H ) > k ■ 1>  2>  > 

and  weak  and  strong  stochastic  convergences  of  ^ to  M.  Thus^  the 
problem  that  will  occupy  us  is  this:  to  find  analytic  conditions  on 

(l.U)  which  will  give  large  values  to  the  numbers  (l,f>)  and  which  will 
insure  weak  and  strong  stochastic  convergences  of  J to  M„ 

Now,  there  is,  of  course,  a limitation  on  the  desirable  analytic 
conditions  for  which  we  may  seek,  a limitation  that  must  be  observed  if 
the  computer  is  to  be  able  to  realize  these  conditions  in  the  choices 
(of  random  devices)  that  are  open  to  him*  namely,  the  fulfillment  of 


' 


the  conditions  must  be  feasible  under  the  'information  at  hand  at  each 


stage  o To  illustrate  this,,  consider  that  after  any  stage 9 say  the  k-th, 
it  would  be  hiphly  desirable  to  select  random  devices  to  secure 


(where  Ak+1  denotes  the  complement  of  A^+^  in  D)  we  do  not  have  enough 
information  to  locate  a subset  of  Ks  therefore  not  enough  information 
to  secure  (l06)„  It  will  be  seen  in  our  discussion  that  our  conditions 
are  feasible  in  the  sense  here  indicated „ 

Our  results  are  stated  in  Theorem  1 (end  of  section  2)  and  in 
Theorem  2 (end  of  section  .3)  . And  in  section  h we  discuss  the  matter 
of  improving  these  results. 

2 0 SUFFICIENT  CONDITIONS  FOR  STRONG  CONVERGENCE  OF  J”  TO  M 

As  of  each  k-th  stage , the  computer  is  motivated  in  two  ways 
toward  putting  feasible  analytic  conditions  on  the  probabilities 
(loU)o  First c,  if  neither  of  the  situations  (l „ 7)  holds,  he  wishes 


whatever  be  Xp  ° • • , x^o  But  unless 


o ® o 


Ak+l^xls  x2>  000  5 XjP  = 0 


or 


to  insure  a positive  (k+l)-th  stage  probability  to  each  point  of 


. 
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x^)|  this 5 by  way  of  not  ruling  out  the  possibility 
of  any  such  point  being  realized  in  the  (k+l)-th  stage,  since  (accord- 
ing to  the  information  at  hand)  f may  have  the  value  M only  at  one  of  these 
complementary  points „ Secondly,  he  wishes  to  assign  high  probabilities 
to  sets  concerning  which  he  has  been  led  to  believe  that  in  them  the 
values  of  f are  higher  than  the  f~  values  he  has  already  discovered,, 

The  conditions  we  shall  put  on  the  probabilities  (ijj.)  will  embody 
these  motivations o 

But  first,  consider  (l*7)  . If  either  of  these  situations  holds, 
and  only  if  one  of  them  holds,  we  know  exactly  which  points  constitute 
K0  And  it  is  then  feasible  to  make  the  probability  of  K unity c Thus, 
we  shall  take 


k+1 


( x1 o ^2 , 


(2.1)  p(\+1£  K | X.  = x.,  i = ls  2S  •••  3 kj  - 1 if  (1*7)  holds  . 

Now  consider  the  first  motivation„  One  feasible  realization  of 
this  is  in  terms  of  a chosen  positive  number  < 1,  and  is  as  fol- 

lows: Whatever  be  x^s  Xg,  000  s if  (l07)  does  not  hold,  we  take 


(2.2) 


^ {^k+l  **  x ® ^i  xis  ^ 000  , 


x £ ^k+l^l5  x2 3 9 xk^ 


k for  each 


The  nature  of  this  condition  is  such  as  to  imply  a sharper  restriction 
on  ck+1.  Let  p(x^3  X2.,  000  s x^)  denote  the  nuntoer  of  points  in 
^k+l^xl5  X2,  000  , x^.)  o (This  number  is  known  under  the  ( information 
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at  hand).  Thens  according  to  (2  ,2) 5 the  conditional  probability  of  the 
set  Ak+1(xls  X23  000  s x^.)  is  greater  than  or  equal  to 

[N-/(x15  x2,  <>«■>  , xk)]  ck+1  , 

and  therefore  this  Humber  must  be  less  than  or,  equal  to  unitys  whence 


1 


"k+l  “ /?(x15  x?5 


Moreover s since  ek+^  is>  taken  to  be  independent  of  Xp  x2S 
we  must  have 


s 


(2.3) 


'k+l  N ~ min 

Xl3X23°,oXk 


^7^7  000  3 77" 


¥e  cannot  say  anything  more  explicit  about  the  number 

0 s xk)  without  further  assumptions  concerning 

s „ As  an  example 9 if 


min  j0(x-p  x2s 
xi5x2s»-sxk 


the  sets  Ak+1(xp  x2, 


Ak+l^xls 


^2  • 


V - fxls 


X, 


23 


>*} 


for  all  Xp  x2j 
and  we  get 


xk3 


then 


x, 


13 


min 

x23  ° ° ° ?xk 


X2  3 ° ° ° 3 ^3 


(2  oh) 


Of  course  3 the  right-hand  side  of  (2  0U)  is  not  greater  than  the  right- 
hand  side  of  (2„3)s  consequently 5 if  we  required  the  fulfillment  of 
(2  o10  3 then  (2,3)  would  be  automatically  satisfied. 


8 


We  now  go  to  the  second  motivation,,  And  this  leads  us  immediately 
to  favor  at  least  the  set  Xg,  »<><>  , x^)  „ That  is,  if  this  set 

is  not  empty,, then  we  know  that  the  f-  values  at  the  points  in  it  are 
greater  than  the  largest  value  of  f so  far  discovered,  and  therefore 
we  want  to  give  the  ooints  of  B^-^  a "better  than  minimal  chance"  of 
being  realized,,  If  the  second  of  the  two  situations  (le?)  holds,  then 
we  know  that  B^^  (xp  Xg,  000  , x^)  “ K,  and  we  proceed  according' to 
(2cl)  „ But  if  (10?)  does  not  hold,  we  do  not  wish  to  throw  all,  the 
conditional  probability  into  B^p  Bor,  according  to  our  incomplete 
information,  the  value  M may  very  well  be  taken  on  only  in 
•^k+1  U ®k+l°  Bhus,  we  shall  want  only  this  much:  to  give  each 

point  of  B^.+^  a probability  - a positive  number,  say  b^.+p  this  number 
being  greater  than  e^p 


(205) 


bk+l  > ®k+l 


1 2 


Thus  do  we  favor  the  points  of  B^.+^  in  particular,  over  the  points 
of  in  general  o 

But-- we  repeat— K may  not  intersect  B^+^  at  alle  The  computer 
may  have  some  feeling  about  this,  not  at  all  based  on  the  formal  in- 
formation at  hando  This  is  to  say  that  he  may  feel  justified  in 
making  a guess  over  and  above  any  logical  justification  by  his  infor- 
mation „ We  want  to  allow  for  such  guessing,  and  we  do  so  as  follows: 
there  shall  be  designated  a set  F^+1  (xp  0 * 0 $ x^)  s disjoint  from 

B^+p  such  that  each  of  its  points , as  well  as  the  points  of  B^p 
shall  be  given  a conditional  probability  - bjp^  o 
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We  have  now  carried  out  fully  the  second  motivation,,  and  it  has 
led  to  the  following  formal  prescription;  whatever  be  x^9  3 x^, 

if  (lo7)  does  not  hold.  we  take 


pfxk+1  = x I X±  - xls  1,  2, 


(2.6) 


“ s k|  - b^..^  for  each 
x ^ ^k-tl^xls  x2 3 ° ° ° 5 


where 


^k+l^xls  x2: 


®k+l^xl5  x25  0 0 0 9 -^k^  U ^k+i^l3  x2 


V 


If  t(x-^5  Xg'j  000  a Xy)  denotes  the  number  of  points  in  the  set 


*\+l  xls  x2 9 


(2.7) 


V\  —■ 

Dk+1 


x^)  then  we  must  have 
1 


max  flx13  X-2 , 000  s x^J 


y -y  o a o y 

ty2-*-2s  5 K 


It  is  to  be  remarked  that  if  is  empty  then  might  very 

well  include  the  set 


(2.8) 


^k+hxl5  x2 


xk)  = 


x is  known  to  have  the 
property  that 

f(x)  = Tk(xls  x2?  °“°  » xi^s 

< x E.  x29  ° ° 0 s xk^ 

or  v 

x is  not  known  to  have  the 
property  that 

f(x)  < T^(x-^s  X2 , 0 • 0 s xk) 
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This  would  represent  the  guess  -that  M may  already  have  been  achieved,, 

We  shall,  in  fact,  make  this  inclusion  assumption!  that  is,  if 
^k+l^xl5  x23  000  3 xk^  — emPt3r->  then  we  take 

(2e9)  ^k+l^xls  x23  00°  3 ~ Ak+l^xl3  x2 5 ” ° ° 3 x k ^ ° 

We  have  how  finished  our  specification  of  a feasible  set  of  analytic 
conditions  on  the  conditional  probabilities  (l„U),  and  we  are  ready  to 
study  the  implications  of  these  conditions „ We  direct  our  attention  to 
obtaining  a lower  bound5  in  terns  of  the  constants  bj,  and 
k - 2,  3S  0 0 0 5 s for  the  comprehensive  probability 
P {Ts(X15  X23  0 0 0 3 Xg)  - M}  . Let  us  define  another  estimator 
{Tk°,  k = 1,  2,  »co  | as  follows? 

(2o10)  Tjc°(xis  x2'5  ° ” ° , x^)  - max  f(x) 

y»jy  *y*  o o o ‘V’ 

By  virtue  of  the  inclusion  X23  000  , x^j  S Ak+l^xl3  X2S  0 0 0 3 3 

we  have  always 

(2-U)  Tk°(xi3  x23  °°°  3 V £ Tk(xl3  X23  3 ^ ° 


Snd  therefore 


(2.12) 


P{Ts(Xl>  X2>  > Xk)  “M} 


X2,  •••  , Zs)  - M}  , s - 1,  2 


- 
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Hence , we  shall  have  a lower  bound  for  the  left-hand  side  of  (2„12)  when 
we  have  found  a lower  bound  for  the  right-hand  side  „ 

Now,  the  right-hand  side  of  (2„12)  is  equal  to 
1-  p{Ts°(X1s  X2s  •••  , Xs)  < m]  ( cf o (1.1)  and  (2.11)),  apd  there- 
fore we  may,  and  shall  actually  work  first  toward  an  upper  bound  for 

(2.13)  P (ts°(X13  X2,  •••  , Xs)  < M}  = P (f(X±)  < M,  i = 1,  2,  ..o  , s} 

Toward  this  end  we  set  down  the  following  equalities  and  inequali- 
ties,  derived  from  (2  d) , (2.2),  (2  „6) , and  (2„9). 

(2.1U)  P {f(Xk+1)  < M | Xj.  = x±,  i - 1,  2,  •••  , k]  = 0 if  (1.7)  holds 
If  (l„7)  does  not  hold,  and  if  K f|  Gk+^(x-^,  x2,  000  , x^)  fit  , 

then 

(2.1?)  P {f(Xk+1)  < M | X±  = xi5  i = 1,  2,  000  ? k]  4 1 _ bk+1  . 

If  (I.?)  does  not  hold,  and  if  K D Gk+^(x^s  x2,  0 • 0 , x^,)  - A 3 

then, 

(2.16)  P {f(Xk+1)  < M | X±  = x±,  i - 1,  2,  • ••  , k } ^ 1 - V ck+1  . 

I 

These  three  assertions,  for  k - 2,  3,  000  , s - 1,  together  with 

P { f (X2  ) < M | - 1 , 


are  now  applied  to  the  relations 
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P (f(Xi)  < M,  i = 1,  2,  °°°  , k + l)  ® 


Xp{Xi=Xi?i  = ls23‘><’«3kj>  3k=2333',05s-l 


And  the  result  in  general  is  the  followings 

p{f(Xi)  < M,  i = 1,  2,  ...  , s}  ^ 

(2.18) 

o 

TT  (max  [1  - b .,  1 - vc,]),  s = 2,  3S  000  , 

j=2  3 J 

whence 

p{Ts°(xi>  x2>  •••  xs5  =M}  - 

(2.19) 

s 

1 “ TT  (max  [!  “ b 1 " s = 2,  3,  •••  . 

j=2  J J 

¥e  have  said  that  (2„l8)  is  the  consequence  of  our  calculations 
in  general.  One  finds,  in  carrying  out  those  calculations,  that  the 
following  hold: 

If  for  all  k = 2,  3,  ° 00  , and  all  x^,  • • • , 

Gk+l^xl5  X2J  °°°  5 con~baj-ns  a point  of  K,  then 


(2a20) 


P {ts°(X1s  x2,  •••  , Xs)  = m}  * 

1 - fr  (1  - b ),  s = 2,  3,  •••  . 

5S2  j 
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If  for  all  k - 2„  3 


^k+l^xls  x 2 


3 ° ° ° s amd  all  x^,  Xg  9 


x^)  contains  no  point  of  Ks  then 


(2  e2l) 


{Ts0(x 


Y 

Is  2J 


(1  - Vc  „) 


*Xs 


) - to  j 


3 c, 


The  import  of  these  special  results  is  this  0 Suppose  the  computer  has 
great  confidence  in  his  sets  * 0 ° » x^.)5  that  is5  confidence 

that  they  contain  points  of  K0  Then  he  will  make  the  numbers  bj  big 
(iee05  very  close  to  l)  with  the  result  that  they  will  be  much  bigger 
than  the  corresponding  numbers  Then  if5  although  this  is  not 

within  his  information,  the  sets  ^^(x^  X2,  °°°  5 x^.)  do  in  fact  all 
contain  points  of  K(i0e0,  his  confidence  in  these  sets  is,  unbeknown 


to  him9  actually  verified),  the  probabilities  P jT^0 (X^9  X 


2? 


,)  =MI 


have  the  larger  lower  bounds  1 - Jj  (l  - b.)  . On  the  other  hand,  if t 

j-2  J 

in  facte,  none  of  the  sets  G^+-^  contains  a point  of  K (in  spite  of  his 

confidence)  then  we  can  only  assert  (2e2l),  with  the  smaller  lower 
s 

bounds  1 - "f[  (1  - Vc  .) » 

Now,  a necessary  and  sufficient  condition  that  the  products  in 


(2  0l8)  and  (2.19).  for  s = 2,  3, 


tend  to  0 is  that  the  series 


oo 


min  [bjS  Vc  .] 


be  divergento  This  condition,  which,  through  (2„19)  and  (2„12)  is 
seen  to  be  a sufficient  condition  for  weak  convergence  of  Cf  to  M,  is 


iu 


however,  not  a feasible  condition,  for,  we  do  not  know  the  value  of  V . 
However,  by  virtue  of  (2  £)  we  have 


s s 

£ min  [b  , yc  ] > £ e , s = 2,  3,  •••  • 

j*: 2 3 J j-2  J 


Hence,  we  can  take  the  feasible  condition  that  2 c . be  divergent « This 


has,  in  addition,  the  consequence  (again  through  (2,5))  that  also  2 b 


3 


is  divergent,  Thus,  the  convergence  to  1 of  the  right-hand  sides  of 
all  three  inequalities  (2.19),  (2„2l),  and  (2.20 ) is  assured. 

Hence,  the  divergence  of  2 c . insures  the  weak  convergence  of  j 
to  M in  any  case  * But  more  than  this5  it  insures  the  strong  convergence 
of  cf  to  Mo  This  is  a consequence  of  (1.2)  . That  is,  by  virtue  of 
(1.2)  we  have,  for  each  s - 1,  2,  ° ° » , 


(2.22) 


|(x15  Xg; 


«•  •)  £ DXDX  o,o  | Tc(x15  Xg,  ° 8 ° 

» • “)  2 DX  D X ° ° ° j lim  T (x^,  x0 , 

r— * oo  r 


> xs) = Mj 

° * ° , xr)  = m] 


which  implies  that,  for  each  s s 1,  2,_°..  , 

P ( lim  Tr(Xls  X2,  •••  , X ) = M j * 

(2.23) 

p{ts(XiS  X2S  •••  , Xs)  =.  m}  , 


so  that  weak  convergence  implies  strong  convergence. 

If  we  combine  (2,12)  with  the  inequalities  (2,19),  (2,20),  and 
(2  ,2l)  to  writes  in  general 


IS 


(2e2b) 

1 - 


p{ts(Xi»  X2,  , xs)  -«}  * 

S 

TT  (max[l  - b.,  1 - VcJ),  s s 25  39 
j=  2 J J 


9 


if  for  an 

k - 2,  3, 

o o o 

9 . 

and 

all  x-^ 

.9  ^2  9 

9 • O 

3 Xk9 

Gk+l^xl-9  X2S 

contains  a 

point  of 

then 

P < 

iVxi> 

x2J 

3 

xj  = 

s 

»] 

f - 

(2  025j) 

s 

i-  rr  a- 

b,) 

9 S = 

2S  3 5, 

c . 

O 

j2 

* 2 

3 

if  for  all 

k = 25  3S 

3 . 

and 

all 

9 X2  9 

o © o 

3 Xk3 

Gk+l^xls  *2 3 

contains  no  point  of 

K9  then 

(2  02 6) 

pl 

o o o 

$ 

xs)  - 

M 

j,  'a 

s 

1 - TT  (1  - VC,),  s - 2,  3,  •••  I 
3-2  0 


then  we  can  state  our  result  as  follows? 

THEOREM  10  If  the  sto ehasti c process  $ satisfies  the  conditions 
(20l),  (2  02)j  {2  S)  9 (2  06)s  and  (2„9)5  then  the  inequalities  (2  «,2 U)  3 
(2  02E>)  9 and  (2.26)  hold  in  the  respective  cases  described „ And  in  any 
ease  a sufficient  condition  that  Cf  - {T  g(XiS  Xg,  •••  , Xg)  J, 


s - 1,  2S 


°]  converge  almost 


to  M is  that  the  series  2 c . be 

0 


divergent 


This  theorem  actually  states  a weaker  convergence  result  than  we 
have  establishedo  That  iss  (l02)  and  consequently  (2  c22)  and  (2C23) 
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hold  also  with  T°  in  place  of  T„  Therefore^  through  the  inequalities 
(2019)s  (2„20)s  and  (2  02l)5  the  divergence  of  2 implies  almost  sure 
convergence  of  ° ” ° > Xg)5  s = 1,  2S  » • ° "j  to  M0 

And  this  is  the  stronger  result  by  virtue  of  (2 cll)  and  (l„l)  „ However s 
our  attention  is  concentrated  on  the  random  variables  T (X^,  °°°  s xs 

and  this  being  the  case5  our  detailed  results*,  namely  the  inequalities 
(2„2U),  (2  e25»)  $ and  (2  026)  are  in  fact  the  weaker  for  having  come  about 
by  our  taking  advantage  of  (2o12)0  We  shall  say  more  about  this  in 
section  I4, 


3o  NECESSARY  CONDITIONS  FOR  STRONG  CONVERGENCE  OF  J TO  M 


In  this  section  we  shall  obtain  upper  bounds  for  the  probabilities 
P ^T^(X^3  0 , Xg)  s M*j  5 s ~ 13  25  °°°  3 and  from  them  obtain  a 

necessary  condition  for  strong  convergence  of  CT  to  M, 

Let  us  define 


(3  ol) 


V 


|x  6 A]c+1(x1,  x2,  •••  , xfc)  I f(x)  is  known  j- 


and 


(3<>2)  a^+1 

rain  p{xk+i  8 Ak+l^xl*  x2 s °!"  * I xi  = V 1 = 2>  000  j k 

X]_SX2  } ° ° ° 


k s 1,  2,  ••• 
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It  is  to  be  noted  that  the  are  feasible  parameters!  that  is5  the 
computer  can  control  them  (to  within  the  requisite  guessing)  . 

Now  let  us  direct  our  attention  to  the  following  important  subsets 

of  Ttt 

(33)  Hk(xls  •••  , xka=1)  =|  xk  £ D|Tk(x15  ^5  ° • ° , < M | . 


For  k = 1 we  set  also 


(3, 


^ - [x  2 D | T1(x)  < mJ> 


By  virtue  of  (l„2)  we  have 


(3.S) 


M \ e Hk(X1S  X2: 


5 Xk“l^  3 ^en 


xk-=l  5 Hk-l^l5  *2  9 °os  5 xk=2^ 


An  equivalent  statement  is  the  followings 

«N 

if  £ Hk.1(x1,  x2,  •••  , ^_2) 

^k^xl3  x2; 


(3.6) 


then 


1 9 000  j xk“l^  A 


A consequence  of  this  is  that  if  Hk=^(x-^a  X2^  °”°  9 xk„2^  = A 3 then 
for  each  xk=1  E D also  H^x-^  X25  ° « 0 9 xk_]_^  ” A „ 

From  (3a1?)  or  (3„6)  we  get  the  following  identity! 


' 


■ 
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(3  o?) 


3^  £ 

XpS  Hg  (x^)  o o o 

xs-l£  HS“1^X15X25  ° ’ °sxs-2^ 


{ts(x1s  X2,  ...  , Xs)  <m}  - 

P(XS  a Hg(XpX2>  ■>003XS«1)  lXl  s V 1 = Vs000,3-!} 


P X 


s-1 


* Xs=ilxi  = x±s  i = 1,2,°°°,  S - 2} 


p{x2  ~ ^2  I xi  = xl|  ^ 

pfxi  - Xj}  . 

The  importance  of  the  sets  is  that  they  provide  this  exact  expression 

• , Xs)  < m]  „ 

If  it  were  the  case  that  Tg(x^s  Xg,  0 


for  Ta(Xls  X2, 


, xg)  = M for  ail  s and  all 


X]_  9 x2  ? 


x,„,  then  (3o?)  would  vanish  identically c However.,  we  are 


* aS5 

excluding  this  extraordinary  situation,  and  therefore  we  are  assured 
that  for  each  k there  is  at  least  one  set  H^(x-^,  x2,  0 0 0 , x^-^)  which  is 
not  empty o 

Now  let  us  take  note  of  our  information  scheme.  If,  at  the  (k=l)th 
stage,  the  knowledge  of  the  values  of  f on  Ak°(x]_s  x^,,  , x^=^)  could 

in  itself  improve  upon  the  number  T^-^(x^s  x2,  0 0 0 , x^=^)s  then  in  the 
information  at  that  stage  we  should  have  put,  not  Tk=^(x^,  Xg,  0 0 • , x^,  ) 

I 

but  the  improved  number 0 In  other  words,  it  is  implicit  in  the  informa- 
tion scheme  that  if  the  k-th  realized  point,  x^,  is,  as  a point  of  D,  a 
duplicate  of  one  of  the  points  in  A^°(x^,  x2,  0 • ° , x^_^)  then 


Tk(x15  Xg*  , 


V 


Tk-fxl»  x2: 


, x^-^)  — that  is,  the  T -value 


' 
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does  not  increase  „ In  particular  5 there  fore  9 if 

\"l^xls  x23  °°°  5 Xk“l^  < ^len  for  eac^  x e Ak°^Xl5 
we  have  T^(x^,  Xg3  ° « ° s xk«,]_3  x)  <M*  Otherwise  expressed: 

r 


(3o8) 


if 1 ^ ^k~l^xl 3 x2 


3 3 


x^)  5 then 


AkU(X13  X23  0C°  S Xk„X)  & %( 


X*|  ^ Xp  1 


3 Xk_]_^ 


Combining  this  with  (3e6)  we  get; 

if  x23  °'°  3 xk-i^  ^ A 3 then 


^ ^x]_s  x23  °°°  3 xk“l^  — ^k^xl3  x2 9 


From  (3«,2)  and  (3  ®9)  we  get  the  following  inequalities: 


Vi> 


P 


h ? v 


X13  X23  °°°  3 ^“1 


k - 2S  3, 


iji-'lj  2,  •• 0 , k - : 


If s now3  we  set 


p|xi£  H^J  = ai 


and  apply  (3®10)  to  (3.?)s  we  obtain  the  following  result: 


{ts(x 


Is  X23 


Xs)  < m]  » JT  a 


(3.11) 

And  therefore,, 

(3.12)  P {ts(X1?  X2?  ...  , Xs)  - m]  * 1 


1 


3=1 


3 xk=1) 


} ~ \ > 


I d 
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If  there  is  strong  convergence  of  U to  M„  then  the  left-hand 

side  of  (3 «X2)  must  tend  to  1 as  s ro0  Consequently  we  must  have 
s 

"fj  a.  0 (by  our  assumption  of  a non-trivial  problem  5 a,  > 0)  0 We 
j-2  0 1 

state  this  result  in 

THEOREM  2 „ A necessary  condition  that  the  s toe  has  tic  process 
give  strong  (or  weak;,  since  the  two  are  equivalent  by  virtue  of  ( 1 „2 ) 
convergence  of  to  M is  that  (cf„  ( 3 02) ) 

s 

(3,13)  lim  Tf  a.  = 0 o 

s~^  co  j=2  J 

The  inequalities  (3 .12 ) hold  for  J „ 

It  is  readily  seen  that  the  relations 

(3 .111)  ^ 1 - q±  , i ~ 2 s 3S  000 

hold^  so  that  the  divergence  of  2 c^  implies  (3.13) « 

This  theorem  is  in  the  nature  of  an  exact  verification  of  the 
first  motivation  discussed  in  section  2„  It  says,,  roughly 9 that  we 
must  not  consisteritly  allot  too  much  conditional  probability  to  the 
points  at  which  the  value  of  f is  alre  ac£y  knowns  if  we  are  to  have 
stochastic  convergence  of  3T  to  Mo 

By  virtue  of  (2,11),  the  inequalities  (3.12)  and  the  theorem 
hold  a fortiori  if  T is  replaced  by  T0o 


m 


■ 
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he  CONCERNING  SHARPENING  OF  THE  RESULTS 

The  results  we  have  obtained  in  section  2 are  quite  obviously 
obtained  actually  for  the  estimator  ^T.,0,  s-l?2?  °"°|s  and  come  to 
bear  upon  *£ts5  s = 15  2,  *taj  through  (2,12),  This  means  that  those 
results  could  have  been  obtained  even  if  we  had  not  introduced  the 
hypothesis  of  additional  k-th  stage  information  embodied  in  the  generally 
conceived  sets  Xg,  0 6 0 s x^)  and  B^,+-(x15  °°°  ? 

if  we  had  assumed  V(xi>  Xgj  000  5 x^.)  as  the  only  k-th  stage  informa- 
tion, And  the  results  in  section  3 obtain  by  the  grace  of  (3*9)  5 which  is 
a completely  general  relation,,  therefore  holding  also  in  the  special 
case  where  c)^°(x^s  X23  000  5 x^)  is  the  only  information. 

One  of  the  directions  for  investigation  after  bounds  which  will 
be  improvements  on  (2,2^),  (2  02^)s  (2,26)3  and  (3,12)3  is  in  making 
additional  assumptions  concerning  the  information  scheme.  For  example, 
if 1 we  specialize  to  the  situation  that  D is  a rectangular  lattice  in 
the  plane, 

D - [ (mh,  nh)  | m,  n = ls  2, , ° ° 0 ' , r j 5 


and  that  the  computer  knows  f to  have  the  property 


|f(rnh,  nh)  - f(m5hs  n'h)  j < ^ 


for  m*  = m t 1,  n*  - n ± 1 , 


where  °(  is  a fixed  number,  then  the  following  can  be  said:  if 


x.(j  = 1,  2,  000  , k)  is  such  that  f (x  „)  + & - 


max 
i+1,2,  * 0 *k 


f (x. ) 
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then  the  four  points  x„  * (h„  h)  s x„  + (-h,  h)  9 x„  + (h5  ~h)5  and 

<D  <D  0 

Xj  + (~h5  -h)  all  belong  to  A^^Cx^ 

Such  additional  facts  concerning  the  sets  A^+^,  311(1  iFk+l 

would  enable  us  in  general  to  improve  the  inclusion  relation  (3  09) | 
that  is  s enable  us  to  assert  a larger  set  than  A^°  included,  in 
In  this  way  we  could  obtain  smaller  upoer  bounds  for 
P X29  °00'  3 Xg)  ~ m}  than  given  in  (3d2)  0 

It  is  easy  to  see  that  we  always  have: 


(lid) 


Hk(xls  Xg,  000  , xk_1) c K o 


This  relation  can  be  used  with  (3  0 7)  to  obtain  lower  bounds  for  the 
probabilities  P {Tg(X^5  X23  ° s Xg)  - M | „ And  these  lower  bounds 
are  precisely  (2  02k)  s (2  02£)s  and  (2026)  0 Mow,  additional  facts 
concerning  the  sets  A^+^s  and  F.^  would  in  general  enable  us 

also  to  improve  (lul),  and  thereby,,  through  (3»7)  to  obtain  better 
bounds  than  given  in  (2  02i|)3  (2  02E>)5  and  (2  06)  0 

Of  course 5 such  improved  bounds  would  in  turn  give  us  less 
stringent  sufficient  conditions  for  convergence  of  J"  to  M and  more 
complete  necessary  conditions  „ 

But  even  without  making  additional  assumptions  concerning  the 
information  scheme,  there  is  a refinement  that  can  be  made  to  im- 
prove our  results 0 This  is  to  introduce  more  control  parameters 
than  the  b^  and  namely,,  different  parameters  for  different 


' 


©8  -e  -rii 


- 
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values  of  /?(x^s  x2?  °°°  , x^.)  s X (x^5  x 2,  ° ° ° , x^) , and 
X(x^3  x23  ° ° » 3 x^)  (-  number  of  points  in  x^  8 8 ° > x^)) 

as  well  as  for  different  stages  k„  This  would  call  for  more  refined 
calculations  with  (2„17)  and  (3 o7) « 
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